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NOVEL CYTOCHROME P450 MONOOXY GEN AS E S AND THEIR 
USE FOR OXIDIZING ORGANIC COMPOUNDS 

The present invention relates to novel cytochrome P450 
monooxygenases with modified substrate specificity which are 
capable of the oxidation of organic substrates, for example 
N-heterocyclic aromatic compounds, nucleotide sequences coding 
therefor, expression constructs and vectors comprising these 
sequences, microorganisms transformed therewith, processes for 
the microbiological oxidation of various organic substrates, such 
as N-heterocyclic aromatic compounds and in particular processes 
for the preparation of indigo and indirubin. 

Enzymes having novel functions and properties can be prepared 
either by screening of natural samples or by protein engineering 
of known enzymes. Under certain circumstances, the last -mentioned 
method can be the more suitable to induce characteristics whose 
generation by the natural selection route is improbable. Despite 
numerous attempts at the engineering of enzymes , up to now there 
are only a few successful studies for promoting the catalytic 
activity of enzyme mutants with respect to a certain substrate 
(1-10). In these known cases, the substrates are structurally 
closely related to the native substrate of the respective enzyme. 
As yet, there aire no reports on the successful engineering of 
enzymes which, after modification, catalyze the reaction of a 
compound which structurally is completely different from the 
native substrate of the enzyme. 

The cytochrome P450 monooxygenase isolatable from the bacterium 
Bacillus megaterium usually catalyzes the subterminal 
hydroxy lation of long-chain, saturated acids and the 
corresponding amides and alcohols thereof or the epoxidation of 
unsaturated long-chain fatty acids or saturated fatty acids of 
medium chain length (11-13). The optimal chain length of 
saturated fatty acids is 14 to 16 carbon atoms. Fatty acids 
having a chain length of less than 12 are not hydroxylated (11). 

The structure of the heme domain of P4 50 BM-3 was determined by 
X-ray structural analysis (14-16). The substrate binding site is 
present in the form of a long tunnel-like opening which extends 
from the surface of the molecule as far as the heme molecule and 
is almost exclusively bordered by hydrophobic amino acid 
residues. The only charged residues on the surface of the heme 
domain are the residues Arg4 7 and Tyr51. it is assumed that these 
are involved in the binding of the carboxylate group of the 
substrate by formation of a hydrogen bond (14). The mutation of 
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Arg47 to Glu brings about inactivation of the enzyme for 
arachidonic acid (13), but increases its activity compared with 
Ci2-Ci4-alkyltrimethylammonium compounds (17). Substrate 
utilization for aromatic compounds, in particular mono-, bi- or 
5 polynuclear, if desired heterocyclic, aromatics, alkanes, 

alkenes, cycloalkanes and cycloalkenes , has not been described 
for this enzyme. Until now, it was therefore assumed in 
specialist circles that substrates other than the organic 
substrates hitherto described, for example indole, on account of 
10 the clear structural differences from the native substrates of 
P450 BM-3, in particular on account of the absence of functional 
groups which could bind to the abovementioned residues in the 
substrate pocket, are not a substrate. 

15 It is an object of the present invention to make available novel 
cytochrome P450 monooxygenases having modified substrate 
specificity or modified substrate profile. In particular, 
monooxygenase mutants are to be provided which, in comparison 
with the nonmutated wild- type enzyme, are enzymatic ally active 

20 with structurally clearly different substrates. 

Compared to the wild-type enzyme, a "modified substrate profile" 
can be observed for the mutants according to the invention. In 
particular, for the mutant in question, an improvement in 

25 reactivity is observed, for example an increase of the specific 
activity (expressed as nmol of converted substrate /minute /nmol of 
P450 enzyme) and/or of at least one kinetic parameter selected 
from the group consisting of Kcat, Km and Kcat/Km {for example by 
at least 1%, such as 10 to 1000%, 10 to 500% or 10 to 100%) in 

30 the conversion of at least one of the oxidizable compounds 

defined in groups a) to d) . The oxidation reaction according to 
the invention comprises the enzyme-catalyzed oxygenation of at 
least one exogenous {i.e. added to the reaction medium) or 
endogenous (i.e. already present in the reaction medium) organic 

35 substrate. In particular, the oxidation reaction according to the 
invention comprises a mono- and/or polyhydroxylation, for example 
a mono- and/or dihydroxylation, at an aliphatic or aromatic C-H 
group, or an epoxidation at a C«<: group which is preferably 
non-aromatic. Also possible are combinations of the above 

40 reactions. Moreover, the immediate reaction product can be 

converted further in the context of a non-enzymatic subsequent or 
side reaction. Such combinations of enzymatic and non-enzymatic 
processes likewise form part of the subject-matter of the present 
invention . 
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We have found that the above object is surprisingly achieved by 
means of novel cytochrome P450 monooxygenases which, for example, 
are capable of the oxidation of N-heterocyclic bi- or polynuclear 
aromatic compounds . 

In particular, the invention relates to those monooxygenases 
whose substrate-binding region is capable by means of 
site-specific mutagenesis of the functional uptake of novel, for 
example N-heterocyclic substrates. 

in a preferred embodiment of the invention, the novel 
monooxygenases are soluble, i.e. existent in non membrane-bound 
form, and enzymatically active in this form. 

15 The monooxygenases according to the invention are preferably 

derived from cytochrome P450 monooxygenases of bacterial origin, 
as derived, in particular, from cytochrome P450 monooxygenase 
BM-3 from Bacillus megateritun having an amino acid sequence 
according to SEQ id NOs2, which has at least one functional 

20 mutation, i.e. promoting the oxidation of novel organic 

substrates (cf. in particular the groups a) to d) of compounds as 
defined below), for example N-heterocyclic mono-, bi- or 
polynuclear aromatic compounds, in one of the amino acid sequence 
regions 172-224 (F/G loop region), 39-43 <J3-strand 1), 48-52 

25 (fl-strand 2), 67-70 (fl-strand 3), 330-335 (B-strand 5), 352-356 
(fl-strand 8), 73-82 (helix 5) and 86-88 (helix 6). 

The cytochrome P450 monooxygenase mutants provided according to 
the invention are preferably capable of at least one of the 
30 following reactions: 

a) oxidation of unsubstituted or substituted N-, O- or 
S-heterocyclic mono-, bi- or polynuclear aromatic compounds; 

b) oxidation of unsubstituted or substituted mono- or 
35 polynuclear aromatics; 

c) oxidation of straight-chain or branched alkanes and alkenes- 
and ' 

d) oxidation of unsubstituted or substituted cycloalkanes and 
cycloalkenes . 



40 



Preferred monooxygenase mutants have at least one functional 
mutation, in particular amino acid substitution, in at least one 
of the sequence regions 73-82, 86-88 and 172-224. Thus, for 
example, Phe87 can be replaced by an amino acid having an 
45 aliphatic side chain, such as Ala, Val, Leu, in particular Val; 
Leul88 can be replaced by an amino acid having an amide side 
chain, such as Asn or, in particular, Gin; and Ala74 can be 
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replaced by another amino acid having an aliphatic side chain r 
such as Val and, in particular, Gly. 

Particularly preferred monooxygenase mutants of this type are 
5 those which have at least one of the following mono- or polyamino 
acid substitutions: 

1) Phe87Val; 

2) Phe87Val, Leul88Gln; or 

10 3) Phe87Val f Leul88Gln, Ala74Gly; 

and functional equivalents thereof. The number indicates the 
position of the mutation; the original amino acid is indicated 
before the number and the newly introduced amino acid after the 
1 5 number - 

In this context, "functional equivalents" or analogs of the 
mutants which are disclosed specifically are mutants differing 
therefrom which furthermore have the desired substrate 
20 specificity with respect to at least one of the oxidation 
reactions a) to d) described above, i.e., for example, for 
heterocyclic aromatics and which hydroxylate, for example, 
indole, or furthermore exhibit the desired "modified substrate 
profile" with respect to the wild-type enzyme. 

25 

"Functional equivalents" are also to be understood as meaning in 
accordance with the invention mutants which exhibit , in at least 
one of the aboveroentioned sequence positions, an amino acid 
substitution other than the one mentioned specifically, but still 

30 lead to a mutant which, like the mutant which has been mentioned 
specifically, show a "modified substrate profile" with respect to 
the wild-type enzyme and catalyze at least one of the 
abovementioned oxidation reactions. Functional equivalence exists 
in particular also in the case where the modifications in the 

35 substrate profile correspond qualitatively, i.e. where, for 
example, the same substrates are converted, but at different 
rates . 

"Functional equivalents" naturally also encompass P450 
40 monooxygenase mutants which, like the P450 BM3 mutants which have 
been mentioned specifically, can be obtained by mutating P450 
enzymes from other organisms. For example, regions of homologous 
sequence regions can be identified by sequence comparison. 
Following the principles of what has been set out specifically in 
45 the invention, the modern methods of molecular modeling then 
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allow equivalent mutations to be carried out which affect the 
reaction pattern. 

"Functional equivalents" also encompass the mutants which can be 
5 obtained by one or more additional amino acid additions, 

substitutions, deletions and/or inversions, it being possible for 
the abovementioned additional modifications to occur in any 
sequence position as long as they give rise to a mutant with a 
modified substrate profile in the above sense. 

10 

Substrates of group a) which can be oxidized according to the 
invention are unsubstituted or substituted heterocyclic mono-, 
bi- or polynuclear aromatic compounds; in particular oxidizable 
or hydroxylatable N-, O- or S -heterocyclic mono-, bi- or 

15 polynuclear aromatic compounds. They include preferably two or 
three, in particular two, 4- to 7-raembered, in particular 6- or 
5-membered, fused rings, where at least one, preferably all, 
rings have aromatic character and where at least one of the 
aromatic rings carries one to three, preferably one, N-, O- or 

20 S-heteroatom in the ring. The total ring structure may contain 
one or two further identical or different heteroatoms. The 
aromatic compounds may furthermore carry 1 to 5 substituents at 
the ring carbon or heteroatoms. Examples of suitable substituents 
are C x - to C 4 -alkyl, such as methyl, ethyl , n- or isopropyl, n-, 

25 iso- or t-butyl, or C 2 - to C 4 -alkenyl, such as ethenyl, 

1-propenyl, 2-propenyl, 1-butenyl, 2-butenyl or 3-butenyl, 
hydroxyl and halogen, such as F, CI and Br. The alkyl or alkenyl 
substituents mentioned may also have a keto or aldehyde group; 
examples being propan-2-on-3-yl r butan-2-on-4-yl, 

30 3-buten-2-on-4-yl. Non-limiting examples of suitable heterocyclic 
substrates are, in particular, binuclear heterocycles , such as 
indole, N-methyl- indole, and the substituted analogs thereof 
which carry one to three of the above-defined substituents on 
carbon atoms, for example 5-chloro- or 5 -bromo indole; and also 

35 quinoline and quinoline derivatives, for example 

8-methylquinoline, 6-methyl-quinoline and quinaldine; and 
benzothiophene, and the substituted analogs thereof which carry 
one to three of the above-defined substituents on carbon atoms. 
Moreover, trinuclear hetero-aromatics, such as acridine and the 

40 substituted analogs thereof which carry one to three of the 
above-defined substituents on carbon atoms, may be mentioned. 

Substrates of group b) which are oxidizable according to the 
invention are unsubstituted or substituted mono- or polynuclear, 
45 in particular mono- or binuclear, aromatics, such as benzene and 
naphthalene. The aromatic compounds may be unsubstituted or raono- 
or polysubstituted and, for example, carry 1 to 5 substituents on 



» 

0050/50915 CA 02380196 2002-01-23 



6 

the ring carbon atoms. Examples of suitable substituents a re Cj- 
to C4-alkyl r such as methyl, ethyl, n- or isopropyl or n-, iso- or 
t-butyl, or C2- to C 4 -alkenyl, such as ethenyl, 1-propenyl, 
2-propenyl, 1-butenyl, 2-butenyl or 3-butenyl, hydroxyl and 
5 halogen, such as F, CI and Br. The alkyl or alkenyl substituents 
mentioned may also have a keto or aldehyde group; Examples being 
propan-2-on-3-yl, butan-2-oh-4-yl, 3-buten-2-on-4-yl . The 
aromatic may be fused with a four- to seven-membered non-aromatic 
ring. The non-aromatic ring may have one or two C=C double bonds, 

10 be mono- or polysubstituted by the abovementioned substituents 
and may carry one or two hetero ring atoms. Examples of 
particularly suitable aromatics are mononuclear aromatics, such 
as cumene, and binuclear substrates, such as indene and 
naphthalene, and substituted analogs thereof which carry one to 

15 three of the above-defined substituents on carbon atoms . 

Substrates of group c) which can be oxidized according to the 
invention are straight-chain or branched alkanes or alkenes 
haying 4 to 15, preferably 6 to 12, carbon atoms. Examples which 

20 may be mentioned are n-butane, n-pentane, n-hexane, n-heptane, 
n-octane, n-nonane, n-decane, n-undecane and n-dodecane, and the 
analogs of these compounds which are branched once or more than 
once, for example analogous compounds having 1 to 3 methyl side 
—groups; or mono- or polyunsaturated; for example 

25 mono-unsaturated, analogs of the abovementioned alkanes. 

Substrates of group d) which can be oxidized according to the 
invention are unsubstituted or substituted cycloalkanes and 
cycloalkenes having 4 to 8 ring carbon atoms. Examples of these 

30 are cyclopentane, cyclopentene, cyclohexane, cyclohexene, 

cycloheptane and cycloheptene. The ring structure may carry one 
or more, for example 1 to 5, substituents according to the above 
definition for compounds of groups a) and b) . Nonlimiting 
examples are ionones, such as a- t (J- and y-ionone, and the 

35 corresponding methyl ionones and iso-methyl ionones. Particular 
preference is given to a- and p-ionone. 

The invention also relates to nucleic acid sequences coding for 
one of the monooxygenases according to the invention. Preferred 

40 nucleic acid sequences are derived from SEQ ID NO:l, which have 
at least one nucleotide substitution which leads to one of the 
functional amino acid mutations described above. The invention 
moreover relates to functional analogs of the nucleic acids 
obtained by addition, substitution, insertion and/or deletion of 

45 individual or multiple nucleotides, which furthermore code for a 
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monooxygenase having the desired substrate specificity, for 
example having indole-oxidizing activity. 

The invention also encompasses those nucleic acid sequences which 
5 comprise so-called silent mutations or which are modified in 
comparison with a specifically mentioned sequence in accordance 
with the codon usage of a specific origin or host organism, and 
naturally occurring variants of such nucleic acid sequences. The 
invention also encompasses modifications of the nucleic acid 
10 sequences obtained by degeneration of the genetic code (i.e. 

without any changes in the corresponding amino acid sequence) or 
conservative nucleotide substitution (i.e. the corresponding 
amino acid is replaced by another amino acid of the same charge, 
size, polarity and/or solubility), and sequences modified by 
15 nucleotide addition, insertion, inversion or deletion, which 
sequences encode a monooxygenase according to the invention 
having a "modified substrate profile-, and the corresponding 
complementary sequences . 

20 The invention furthermore relates to expression constructs 

comprising a nucleic acid sequence encoding a mutant according to 
the invention under the genetic control of regulatory nucleic 
acid sequences; and vectors comprising at least one of these 
expression constructs. 



Preferably, the constructs according to the invention encompass a 
promoter 5 '-upstream of the encoding sequence in question and a 
terminator sequence 3 ' -downstream, and, optionally, further 
customary regulatory elements, and, in each case operatively 
30 linked with the encoding sequence. Operative linkage is to be 
understood as meaning the sequential arrangement of promoter, 
encoding sequence, terminator and, if appropriate, other 
regulatory elements in such a manner that each of the regulatory 
elements can fulfill its intended function on expression of the 
35 encoding sequence. Examples of operatively linkable sequences are 
targeting sequences, or else translation enhancers, enhancers, 
polyadenylation signals and the like. Further regulatory elements 
encompass selectable markers, amplification signals, replication 
origins and the like. 



40 



In addition to the artificial regulatory sequences, the natural 
regulatory sequence can still be present upstream of the actual 
structural gene. If desired, this natural regulation may be 
switched off by genetic modification, and the expression of the 
45 genes may be enhanced or lowered. However, the gene construct may 
also be simpler in construction, i.e. no additional regulatory 
signals are inserted upstream of the structural gene and the 
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natural promoter with its regulation is not removed. Instead, the 
natural regulatory sequence is mutated in such a way that 
regulation no longer takes place and the gene expression is 
increased or reduced. One or more copies of the nucleic acid 
5 sequences may be present in the gene construct. 

Examples of suitable promoters are: cos, tac, trp, tet, trp-tet, 
1PP, lac, l P p-lac, laclq, T7, T5, T3, gal, trc, ara, SP6, 1-pr or 

in k ? P T° moter ' whlch are advantageously employed in Gram-negative 
10 bacterxa; and Gram-positive promoters amy and SP02, the yeast 

Ca£v/35S S SS-'J:*',^: P " 60 ' CyC1 ' GAPDH ° r the * lant P-moters 
CaMV/35S, SSU, OCS, lib4, usp, STLS1, B33, nos or the ubiquitin 

or phaseolxn promoter. Particular preference is given to using 

inducible promoters, for example light- and in particular 

15 temperature-inducible promoters, such as the P r P x promoter. 

In principle, all natural promoters with their regulatory 
sequences can be used. In addition, synthetic promoters may also 
fie used in an advantageous fashion 

20 

The abovementioned regulatory sequences are intended to allow the 
targeted expression of the nucleic acid sequences and of protein 
expression. Depending on the host organism, this may mean, for 
example, that the gene is expressed or overexpressed only after 
25 induction has taken place, or that it is expressed and/or 



The regulatory sequences or factors can preferably have a 
positive effect on expression and in this manner increase or 

30 reduce the latter. Thus, an enhancement of the regulatory 

elements may advantageously take place at the transcriptional 
level by using strong transcription signals such as promoters 
and/or -enhancers". In addition, translation may also be enhanced 

^ by improving, for example, raRNA stability. 

An expression cassette is generated by fusing a suitable prom, 
with a suitable monooxygenase nucleotide sequence and a 
terminator signal or polyadenylation signal. To this end 
customary recombination and cloning techniques are used as they 
40 are described, for example, in T. Maniatis, e.f. Frit sen and 

J. Sambrook, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, cold Spring Harbor, ny (1989) and in 
T.J. Silhavy, M.L. Herman and L.W. Enquist, Experiments with Gene 
45 TllulT' T ^ing Harbor Laboratory, Cold Spring Harbor, ny 
4S (1984) and in Ausubel, F.M. et al.. Current Protocols in 
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Molecular Biology, Greene Publishing Assoc. and Wiley 
Interscience ( 1987 ) . 

For expression in a suitable host organism, the recombinant 
5 nucleic acid construct or gene construct is advantageously 
inserted into a host-specific vector which allows optimal gene 
expression in the host. Vectors are well known to the skilled 
worker and can be found, for example, in "Cloning Vectors" 
(Pouwels P.H. et al., Ed., Elsevier, Amsterdam-New York-Oxford, 

10 1985). Vectors are to be understood as meaning not only plasmids, 
but all other vectors known to the skilled worker such as, for 
example, phages, viruses, such as SV40, CMV, baculovirus and 
adenovirus, transposons, IS elements, phasmids, cosmids, and 
linear or circular dna. These vectors can be replicated 

15 autonomously in the host organism or chromosoraally. 

The vectors according to the invention allow the generation of 
recombinant microorganisms which are transformed, for example, 
with at least one vector according to the invention and which can 

20 be employed for producing the mutants. The above -de scribed 
recombinant constructs according to the invention are 
advantageously introduced into a suitable host system and 
expressed. It is preferred to use usual cloning and trans feet ion 
methods known to the skilled worker in order to bring about 

25 expression of the abovementioned nucleic acids in the expression 
system in question. Suitable systems are described, for example, 
in current protocols in molecular biology, F. Ausubel et al., 
Ed., Wiley Interscience, New York 1997. 

30 Suitable host organisms are, in principle, all organisms which 
allow expression of the nucleic acids according to the invention, 
their allelic variants, and their functional equivalents or 
derivatives. Host organisms are to be understood as meaning, for 
example, bacteria, fungi, yeasts or plant or animal cells. 

35 Preferred organisms are bacteria such as those of the genera 
Escherichia, such as, for example, Escherichia coli, 
Streptomyces , Bacillus or Pseudomonas, eukaryotic microorganisms 
such as Saccharomyces cerevisiae, Aspergillus, and higher 
eukaryotic cells from animals or plants, for example Sf9 or CHO 

40 cells. 

If desired, expression of the gene product may also be brought 
about in transgenic organisms such as transgenic animals such as, 
in particular, mice, sheep, or transgenic plants- The transgenic 
45 organisms may also be knock-out animals or plants in which the 
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corresponding endogenous gene has been eliminated, such as, for 
example , by mutation or partial or complete deletion. 

Successfully transformed organisms can be selected by marker 
5 genes which. are likewise contained in the vector or in the 

expression cassette. Examples of such marker genes are genes for 
resistance to antibiotics and for enzymes which catalyze a color 
reaction/ which causes staining of the transformed cell. These 
transformed cells can then be selected using automatic cell 

10 selection. Microorganisms which have been transformed 

successfully with a vector and which carry an appropriate gene 
for resistance to antibiotics (for example G418 or hygromycin) 
can be selected by using appropriate antibiotics-containing media 
or substrates. Marker proteins which are presented on the cell 

15 surface can be used for selection by affinity chromatography. 

The combination of the host organisms and the vectors appropriate 
for the organisms , such as plasmids, viruses or phages, such as, 
for example, plasmids with the RNA polymerase /promoter system, 
20 phages X, \x or other temperate phages or transposons and/or other 
advantageous regulatory sequences forms an expression system. The 
term "expression system" means, for example, a combination of 
mammalian cells such as CHO cells, and vectors, such as pcDNA3neo 
vector, which are suitable for mammalian cells. 

25 

As described above, the gene product can also be expressed 
advantageously in transgenic animals, for example mice, sheep, or 
transgenic plants. It is likewise possible to program cell-free 
translation systems with the RNA derived from the nucleic acid. 

30 

The invention furthermore provides a process for preparing a 
monooxygenase according to the invention, which comprises 
cultivating a monooxygenase-producing microorganism, if 
appropriate inducing the expression of the monooxygenase, and 
35 isolating the monooxygenase from the culture, if desired, the 
monooxygenase according to the invention can thus also be 
produced on an industrial scale. 

The microorganism can be cultivated and fermented by known 
40 methods. Bacteria, for example, can be grown in a TB or LB medium 
and at 20-40°C and a pH of 6-9. Suitable cultivation conditions 
are described in detail in T. Maniatis, E.F. Fritsch and 
J. Sambrook, Molecular Clonings A Laboratory Manual, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY (1989), for example. 
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If the monooxygenase is not secreted into the culture medium, the 
cells are then lyzed and the monooxygenase is obtained from the 
lysate using known methods for the isolation of proteins. The 
! el ^ 8 ° an bS lyZ6d alternatively by high-frequency ultrasound, by 
5 high pressure, for example in a French pressure cell, by 

osmolysis, by the action of detergents, lytic enzymes or organic 
solvents, by homogenization or by a combination of a plurality of 
the processes mentioned. Purification of the monooxygenase can be 
achaeved by known chromatographic processes, such as molecular 

10 sieve chromatography (gel filtration), such as Q-Sepharose 
chromatography, ion-exchange chromatography and hydrophobic 
chromatography, and by other customary processes, such as 
ultrafiltration, crystallization, salting out, dialysis and 
native gel electrophoresis. Suitable processes are described, for 

15 example, in Cooper, F.G., Biochemische Arbeitsmethoden 

[Biochemical Procedures], Verlag Walter de Gruyter, Berlin, New 
York or in Scopes, R. , Protein Purification, Springer Verlag, New 
York, Heidelberg, Berlin. 

20 To isolate the recombinant protein, it is particularly 

advantageous to use vector systems or oligonucleotides which 
extend the cDNA by certain nucleotide sequences and thus code for 
modified polypeptides or fusion proteins which serve to simplify 
purification. Suitable modifications of this type are, for 

25 example, so-called "tags" which act as anchors, such as, for 
example, the modification known as hexa-histidine anchor, or 
epitopes which can be recognized as antigens by antibodies 
(described, for example, in Harlow, E. and Lane, D. , 1988, 
Antibodies: A Laboratory Manual. Cold Spring Harbor (N.Y.) 

30 Press). These anchors can be used to attach the proteins to a 
solid support such as, for example, a polymer matrix, which can, 
for example, be packed into a chromatography column, or to a 
microtiter plate or to another support. 

35 These anchors can also at the same time be used to recognize the 
proteins, it is also possible to use for recognition of the 
proteins conventional markers such as fluorescent dyes, enzyme 
markers which form a detectable reaction product after reaction 
with a substrate, or radioactive markers, alone or in combination 

40 with the anchors for derivatizing the proteins. 

The invention moreover relates to a process for the 
microbiological oxidation of organic compounds, for example 
N-heterocyclic mono-, bi- or polynuclear aromatic compounds 
45 according to the above definition, which comprises 
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al) culturing a recombinant: microorganism according to the above 
definition in a culture medium, in the presence of an 
exogenous (added) substrate or an intermediately formed 
substrate, which substrate is oxidizable by the monooxygenase 
5 according to the invention, preferably in the presence of 

oxygen ( i.e. aerobically) ; or 

a2) incubating a substrate-containing reaction medium with an 

enzyme according to the invention, preferably in the presence 
of oxygen and an electron donor; and 
10 b) isolating the oxidation product formed or a secondary product 
thereof from the medium. 

The oxygen required for the reaction either passes from the 
atmosphere into the reaction medium or, if required, can be added 
15 in a manner known per se. 

The oxidizable substrate is preferably selected from 

a) unsubstituted or substituted N-heterocyclic mono-, bi- or 
20 polynuclear aromatic compounds; 

b) unsubstituted or substituted mono- or polynuclear aromatics ? 

c) straight -chain or branched alkanes and alkenes; 

d) unsubstituted or substituted cycloalkanes and cycloalkenes . 

25 A preferred process variant is directed to the formation of 
indigo/indirubin and is characterized by the fact that the 
substrate is indole formed as an intermediate in the culture and 
that the indigo and/or indirubin formed in the culture medium is 
isolated by oxidation of hydroxyindole intermediates. 

30 

If the oxidation according to the invention is carried out using 
a recombinant microorganism, the culturing of the microorganisms 
is preferably first carried out in the presence of oxygen and in 
a complex medium, such as, for example, TB or LB medium at a 

35 culturing temperature of approximately 20 to 40°C and a pH of 
approximately 6 to 9, until an adequate cell density is reached. 
The addition of exogenous indole is usually not necessary, as 
this is intermediately formed by the microorganism. However, when 
using other substrates, addition of exogenous substrate may be 

40 required. In order to be able to control the oxidation reaction 
better, the use of an inducible, in particular temperature- 
inducible, promoter is preferred. The temperature is in this case 
increased to the necessary induction temperature, e.g. 42°C in the 
case of the P r Pi promoter, this is maintained for a sufficient 

45 period of time, e.g. 1 to 10 or 5 to 6 hours, for the expression 
of the monooxygenase activity and the temperature is then reduced 
again to a value of approximately 30 to 40°C. The culturing is 
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then continued in the presence of oxygen for 12 hours to 3 days. 
The pH can, in particular in the case of indole oxidation , be 
increased by addition of NaOH, e.g. to 9 to 10, whereby the 
indigo formation or indirubin formation is additionally promoted 
5 by atmospheric oxidation of the enzymatically formed oxidation 
products 2- and 3 -hydroxy indole. 



10 



The indigo /indirubin formation according to the invention is 
illustrated by the reaction scheme below: 




indole 



15 



P450 BM-3 
mutant 



20 




Air 

oxidation 



25 



II 




30 



Air 

oxidation 
dimerization 



35 




40 



indirubin 



I: 2-hydroxyindole (oxindole) 
II: 3 -hydroxy indole (indoxyl) 



45 However, if the oxidation according to the invention is carried 
out using purified or enriched enzyme mutants, the enzyme 
according to the invention is dissolved in an exogenous 
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substrate-containing, for example indoie-containing medium 
(approximately 0.01 to 10 mM, or 0,05 to 5 mM), and the reaction 
is carried out, preferably in the presence of oxygen, at a 
temperature of approximately 10 to 50°C, such as, for example, 30 
5 to 40°C r and a pH of approximately 6 to 9 (such as established, 
for example, using 100 to 200 mM phosphate or tris buffer), and 
in the presence of a reductant, the substrate-containing medium 
moreover containing, relative to the substrate to be oxidized, an 
approximately 1- to 100-fold or 10- to 100-fold molar excess of 
10 reduction equivalents. The preferred reductant is NADPH. If 
required, the reducing agent can be added in portions. 

In a similar manner, the oxidizable substrates which are 
preferably used are: n-hexane, n-octane, n-decane, n-dodecane, 
15 cumene, 1 -methyl indole, 5-C1- or Br-indole, indene, 

benzothiophene, a—, 0— and y — ionone, acridine, naphthalene, 
6-methyl- or 8-methylquinoline, quinoline and quinaldine. 

The enzymatic oxidation reaction according to the invention can 
20 be carried out, for example, under the following conditions: 

Substrate concentration: from 0.01 to 20 mM 

Enzyme concentration: from 0.1 to 10 mg/ral 

25 

s from 10 to 50°C 

pH: from 6 to 8 

30 Buffer: from 0.05 to 0.2 M potassium 

phosphate, or Tris/HCl 

Electron donor: is preferably added in portions 

(initial concentration about 

■ • 

35 0.1 to 2 mg/ml) 

The mixture can briefly (from 1 to 5 minutes) be preincubated (at 
about 20-40°C) before the reaction is initiated, for example by 
adding the electron donors (e.g. NADPH) . The reaction is carried 
40 out aerobically, if appropriate with additional introduction of 
oxygen . 



45 



In the substrate oxidation process according to the invention, 
oxygen which is present in or added to the reaction medium is 
cleaved reductively by the enzyme. The required reduction 
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equivalents are provided by the added reducing agent (electron 
donor ) . 

The oxidation product formed can then be separated off from the 
5 medium and purified in a conventional manner, such as, for 
example, by extraction or chromatography. 

Further subjects of the invention relate to bioreactors, 
comprising an enzyme according to the invention or a recombinant 
10 microorganism according to the invention in immobilized form. 

A last subject of the invention relates to the use of a 
cytochrome P4 50 monooxygenase according to the invention or of a 
vector or microorganism according to the invention for the 
15 microbiological oxidation of a substrate from one of the groups 
a) to d), in particular of N- heterocyclic mono-, bi- or 
polynuclear aromatic compounds , and preferably for the formation 
of indigo and/or indirubin. 

20 The present invention is now described in greater detail with 
reference to the following examples. 

Example 1 : 

25 Randomization of specific codons of P450 BM-3 

The experiments were carried out essentially as described in 
(19). Three positions (Phe87, I*eul88 and Ala74) were randomized 
with the aid of site-specific mutagenesis using the Stratagene 
30 QuikChange kit (La Jolla, CA f USA) . The following PCR primers 
were used for the individual positions : 

Phe87 s 5 ' -gcaggagacgggttgnnnacaagctggacg-3 ' ( SEQ ID NO : 3 ) , 

5 ' -cgtccagcttgtnnncaacccgtctcctgc-3 ' , ( SEQ ID NO : 4 ) 
35 Leul88: 5 ' -gaagcaatgaacaagnnncagcgagcaaatccag-3 ' (SEQ ID NO:5), 

5 ' -ctggatttgctcgctgnnncttgttcattgcttc-3 ' ( SEQ ID NO : 6 ) ; 
Ala74 : 5 ' -gctttgataaaaacttaaagtcaannncttaaatttgtacg-3 ' ( SEQ id : 

NO: 7) , 

5 ' -cgtacaaatttaagnnnttgacttaagtttttatcaaagc-3 ' ( SEQ ID 
40 NO: 8) 

The conditions for the PCR were identical for all three 
positions. In particular , 17.5 pmol of one of each primer, 
20 pmol of template plasmid DNA, 3 U of the Pfu polymerase and 
45 3-25 nmol of each dNTP were used per 50 jil reaction volume. The 
PCR reaction was started at 94°C/1 min and the following 
temperature cycle was then carried out 20 times: 94°C, 1 min; 
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46©C, 2.5 min; 72°C, 17 min. After 20 cycles, the reaction was 
continued at 72°C for 15 min. After the PCR, the template DNA was 
digested at 37°C for 3 h using 20 U of DpnI. E. coll DH5a was then 
transformed. The transformed JE. coli DH5a cells were plated out 
5 onto LB agar plates which contained 150 ug/ml of ampicillin. 
Incubation was then carried out at 3 7°C for 18 h. 

Example 2s 

Expression and purification of the P450 BM-3 and its mutants and 
10 production of a blue pigment 

The P450 BM-3 gene and the mutants thereof were expressed under 
the control of the strong , temperature-inducible PrPi. promoter of 
the plasmid pCYTEXPl in E. coll DH5a as already described (20) -. 

15 Colonies were picked up using sterile toothpicks and transferred 
to mic rot iter plates having 96 hollows, comprising 200 pi of TB 
medium and 100 ug/ml of ampicillin per hollow. Incubation was 

— then carried out at 3 7°C-ove might .-40 -pi of- the -cell culture of 
one of each hollow were then transferred to a culture tube which 

20 contained 2 ml of TB medium with 100 ug/ml of ampicillin. 

Culturing was then carried out at 37°C for 2 h. The temperature 
waB then increased to 42°C for 6 h for induction. Culturing was 

then con tinued at 37°C overnight , a blue pigment being produced . 

25 The preparative production of enzyme or blue pigment waB carried 
out starting from a 300 ml cell culture (OD 578raB a 0.8 to 1.0). 
For the isolation of the enzyme, the cells were centrifuged off 
at 4000 rpra for 10 min and resuspended in 0.1 M K x po 4 buffer, pB 
7.4. The ice-cooled cells were carefully disrupted with the aid 

30 of a Branson sonifer W25 (Dietzenbach, Germany) at an energy 

output of 80 W by 2 min Bonification three times. The suspensions 
were centrifuged at 32570 x g for 20 min. The crude extract was 
employed for the activity determination or for the enzyme 
purification. The enzyme purification was carried out as already 

35 described in (21), to which reference is expressly made hereby. 
The concentration of purified enzyme was determined by means of 
the extinction difference at 450 and 490 nm, as already described 
in (11), using an extinction coefficient e of 91 mM- 1 cm- 1 . 

40 Example 3: 

Isolation of mutants which produce large amounts of blue pigment 

100 colonies in each case were isolated from the mutants of one 
45 of each position, which were produced by randomized mutagenesis 
of the codon of the corresponding position. These colonies were 
cultured in culture tubes for the production of blue pigment. 
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After washing the cells with water and a number of Blow 
centrif ugation steps (500 rpm), the blue pigment was extracted 
using dimethyl sulfoxide (DMSO). The solubility of the blue 
pigment was greatest in DMSO. The absorption of the extract was 
5 determined at 677 nm. That mutant which produced the largest 
amount of blue pigment, especially mutants from a specific 
position, was used for DNA sequencing (ABI DNA sequencing kit; 
ABI Prism™ 377 DNA sequencer) and moreover as a template for 
site-specific randomized mutagenesis. 

10 

Example 4 : 

Activity test for the indole hydroxylation 

15 The indole hydroxylation activity was tested in a solution which 
contained 8 \xl of a 10-5 00 mM indole solution in DMSO, 850 ^1 of 
tris/HCl buffer (0.1 M, pH 8.2) and 0.6 nmol of P450 BM-3 wild 
type or mutant in a final volume of 1 ml. The mixture was 
preincubated for 9 min before the reaction was started by 

20 addition of 50 u.1 of an aqueous 1 mM solution of NADPH . The 

reaction was stopped after 20 sec by addition of 60 of 1.2 M 
KOH. Within 5 to 30 sec (under aerobic conditions), the enzyme 
products were converted completely into indigo 
[A 2 ' 2 '-biindoline]-3,3 '-dione) and indirubin 

25 ( [A 2 '3'-biindoline]-2 ' ,3-dione) . The indigo production was 

determined by means of its absorption at 670 nm. A calibration 
curve using pure indigo showed an extinction coefficient of 
3.9 mM" 1 cm- 1 at this wavelength. A linear curve was obtained for 
indigo production in a reaction time of 40 sec using 0.6 nmol of 

30 wild type or P450 BM-3 mutant and 0.05 to 5.0 mM of indole. 

Indirubin shows a very weak absorption at 670 nm and the amount 
of indirubin formed was very much smaller than the amount of 
indigo formed. The formation of indirubin was neglected in the 
determination of the kinetic parameters . The NADPH consumption 

35 was determined at 340 nm and calculated as described (17) using 
an extinction coefficient of 6.2 mM- 1 cm" 1 . 

Example 5 : 

40 Purification of indigo and indirubin 

After washing the cells with water and repeated centrif ugation at 
500 g, the blue pellet formed was extracted using tetrahydrof uran 
(THF) . The extract was evaporated almost to dryness and the red 
45 pigment was extracted a number of times with 50 ml of absolute 
ethanol. The residual blue solid was dissolved in THF and 
analyzed by thin-layer chromatography (TLC) . The ethanol solution 
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was evaporated and purified by silica gel chromatography (TLC 60, 
Merck, Darmstadt, Germany; 2 cm x 30 cm) before it was washed 
with THF and petroleum ether in a ratio of 1:2. The red solution 
obtained was evaporated and its purity was determined by TI*C • The 
5 absorption spectra of the blue and of the red pigment were 
determined in a range from 400 to 800 nm with the aid of an 
Ultraspec 3000 spectrophotometer (Pharmacia, Uppsala, Sweden). 
The blue and the red color were moreover analyzed by mass 
spectrometry and 1 H-NMR spectroscopy. 

10 

Experimental results 

■ 

1. Increasing the productivity for blue pigment by P450 BM-3 
mutagenesis 

15 

Native P450 BM-3 does not have the ability to produce the blue 
indigo-containing pigment, or the precursor substances 2- or 
3 -hydroxy indole. In order -to be able to prepare a sufficient 
amount of blue pigment, P450 BM-3 was subjected to evolution in a 

20 controlled manner. All mutants which produced the blue pigment 
were sequenced. It was found that at least one of the following 
three positions were mutated: Phe87, :Leul88 and Ala74. It was 
therefore assumed that these three positions play a crucial role 
for the activity of P450 BM-3 in the production of blue pigment. 

25 From the structure of the heme domain of cytochrome P450 BM-3, 
complexed with palmitoleic acid r it is seen that Phe87 prevents 
the substrate from coming closer to the heme group (14). The 
mutant Phe87Val shows a high regio- and stereoselectivity in the 
epoxidation of (14S, 15R) -arachidonic acid (13) and the mutant 

30 Phe87Ala shifts the hydroxylation position of co-l, io-2 and co-3 to 
oj (22). The position 87 was therefore selected as first for the 
site-specific randomized mutagenesis with the aid of PCR. In tube 
cultures, 7 colonies were obtained which produced a small amount 
of blue pigment after induction. The colony which produced the 

35 largest amount of the blue pigment was selected for the DNA 
sequencing. The sequence data showed substitution of Phe87 by 
Val. The mutant Phe87Val was then used as a template for the 
second round of site-specific randomized mutagenesis on position 
Leul8 8. The structure of the heme domain, complexed with 

40 palmitoleic acid, shows that the repositioning of the F and G 
helices brings the residue Leul88 into direct contact with the 
substrate (14). This position can therefore play an important 
role in substrate binding or orientation. After the second 
screening passage, 31 colonies were observed which produced the 

45 blue pigment. The mutant which produced the largest amount of 
pigment contained the substitutions Phe87Val and Leul88Gln. This 
mutant was then mutated in position Ala74 in the third passage of 
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site-specific randomized mutagenesis. In this case the triple 
mutant F87L188A74 (Phe87Val, Leul88Gln and Ala74Gly) was 
obtained, which produced several rag of blue pigment in a 2- liter 
flask, containing 300 ml of TB medium. This amount was sufficient 
5 for the isolation and characterization of the blue pigment. 

2. Isolation and identification of the blue pigment 

After washing the cells, the residual blue pellet was extracted 
10 with THF and analyzed by TLC . The blue pigment was separated into 
a rapidly migrating blue component and into a more slowly 
migrating red component. Both components showed exactly the same 
mobility parameters as the components of a commercial indigo 
sample . 

15 

After the purification, the absorption spectra of both components 
were determined in DMSO. The blue component showed the same 
spectrum as a commercial indigo sample. The purified blue and red 
components were each analyzed by mass spectrometry. The mass 

20 spectra of both pigments showed a strong molecular ion peak at 
m/e - 262 and two fragment peaks at m/e ~ 234 and 205 (relative 
intensity in each case 10%). This pattern is typical of indigoid 
compounds. The elementary composition of these ions was 
determined by high-resolution mass spectrometry as C 16 H a0 N 2 O 2 , 

25 C 15 Hi 0 N 2 O and Ci 4 H 9 N 2 . This is also characteristic of structures of 
the indigo type. The blue pigment was thus identified as indigo 
and the red pigment as indirubin. For the confirmation of the 
structure, 500 MHz *H-NMR spectra of both pigments were carried 
out in DMSO-D 6 solution. The results agreed with the literature 

30 data (23) . 

3. Production of indigo using isolated enzymes 

It is known that indigo is accessible from indole by microbial 
35 transformation (24-26). None of these microbial systems f however, 
contained a P450 monooxygenase . According to the invention, the 
catalytic activity of the pure enzyme for indole was first 
determined. The mutant F87L18 8A74 was mixed with indole. No color 
reaction could be observed. Only after addition of NADPH to the 
40 reaction mixture was the blue pigment formed after approximately 
2 0 min. By adjustment of the pH of the reaction mixture to a 
value of approximately 11, 30 sec after addition of NADPH, the 
blue coloration was visible within a few seconds. Control 
experiments using native P450 BM-3 were always negative, even 
45 using increased concentrations of enzyme, indole and NADPH. The 
blue pigment was extracted using ethyl acetate and analyzed by 
TLC. The blue pigment again separated into a more rapidly running 
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blue component, and into a slower running red component:. The Rf 
values and the absorption spectra were identical to those values 
of the extracts from the fermentation broth. The F87L188A74 
mutant of P450 BM-3 is thus an indole hydroxylase. 

5 

Two routes have previously been described for the enzymatic 
transformation of indole to indigo. One route is catalyzed by a 
dioxygenase, the other by a styrene monooxygenase (24, 25). The 
NADPH stoichiometry is in both cases 2. It was therefore assumed 
10 that in contrast to the dioxygenases the mutant F87L188A74 
according to the invention hydroxylates indole in only one 
position to form oxindole (2-hydroxyindole) or indoxyl 
(3-hydroxyindole) • 

* 

15 4. Kinetic parameters of indole hydroxy lation 

Pure samples of the wild- type enzyme P450 BM-3 and of the mutants 
Leul88Gln, Phe8 7Val, F87L188 and F87L188A74 were used for the 
determination of the kinetic parameters of indole hydroxy lation. 
20 The results are summarized in Table 1 below. 

Table 1: Kinetic parameters of the P450 BM-3 mutants for 

indole hydroxy lation 



25 



35 



40 



Mutants 


Kcat(S-M 




Km (mM) 




Kcat:/K n (M"*S-1) 


WT 


-a) 












l*eul88Gln 


n.d.b) 




n.d. 






n.d. 


Phe87Val 


2.03 (0. 


14) 


17.0 


(1 


.0) 


119 


F87L188 


2.28 (0. 


16) 


4.2 


(0, 


>4) 


543 


F87L188A74 


2.73 (0. 


16) 


2.0 


(0, 


2) 


1365 



45 



a) no activity was observed; 

b) not determined (activity was too low to be measured) 

Even with an excess of purified enzyme and high indole 
concentration, the wild-type enzyme is not able to oxidize 
indole. The mutant Leul88Gln shows a low activity. The mutant 
Phe87Val shows a catalytic activity of 119 M~ 1 s- 1 for indole 
hydroxylation . The catalytic efficiency of the double mutant 
F87L188 (Phe87Val, Leul88Gln) increased to 543 M-*s-i and was 
increased to 1365 M~ 1 s- 1 by introduction of the further 
substitution Ala74Gly. The K cat values increased from Phe87Val to 
the triple mutant by a total of 35% , while the Km values decreased 
approximately by seven-fold. This indicates that Ala74Gly and 
Leul88Gln are mainly involved in substrate binding. 
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For the triple mutant F87L188A74, the indole turnover rate 
(Kcat^.TS s- 1 ) was more than ten times higher than for most P45 0 
enzymes (18) . 



5 Example 6 



Hydroxylation of n-octane using modified cytochrome P450 
monooxygenase 

10 The reactions were carried out using a P450 BM-3 monooxygenase 
mutant comprising the following mutations: Phe87Val Deul88Gln 
Ala74Gly 



The chosen substrate was n-octane. For the hydroxylation of 
15 n-octane, the following aerobic reaction mixture was used: 



P4 50 BM-3 mutant 
Reaction buffer: 

20 Substrate: 



17.5 mg (lyophilisate) 

9.1 ml (potassium phosphate buffer 50 mM, 
pH 7.5) 

50 nl of a 60 mM solution (in acetone) 
2 50c 



The enzyme lyophilisate was dissolved in 500 \il of reaction buffer 
and initially incubated at room temperature with substrate and 

25 reaction buffer for 5 minutes. 300 fxl NADPH solution (5 mg/ml) 
were then added. Addition of NADPH was repeated two more times. 
The progress of the reaction was monitored by measuring the 
absorption at 3 40 nm, which allows the NADPH decrease to be 
observed. NADPH is added in aliquots of 300 jil, since too high a 

30 concentration of NADPH in the reaction solution would result in 
inactivation of the enzyme. To isolate the products, the reaction 
solution was then extracted three times with 5 ml of diethyl 
ether. The combined organic phases were dried over MgS0 4 and 
concentrated. The products were then characterized by TLC, GC/MS 

35 and NMR. 



The GC/MS analysis of the reaction mixture gave the following 
result: 

40 Compound Rtfmin]*) Conversion [%] 

4-octanol 13.51 37 

3-octanol 14.08 47 

2-octanol 14.26 16 



45 



l ) Temperature program: 40°C 1 min isothermic / 3°C/min 95<>C 
/10°c/min 275°C; apparatus: Finnigan MAT 95; GC : HP 5890 Series II 
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Split Injector; Column: HP-5MS (methylsiloxane) 30m x 0.25mm 
Carrier gas: 0.065 ml of He/min 

No starting material was found. 

5 

Example 7: 

Hydroxylation of aromatics, heteroaroraatics and trimethylcyclo- 
hexenyl compounds 

10 

a) Example 6 was repeated, but using, instead of n-octane, the 

substrate naphthalene. The products that were identified were 
1-naphthol and cis-1 ,2-dihydroxy-l, 2-dihydronaphthalene. 88% 
of the naphthalene starting material had been converted. 



15 



25 



Analytic methods for reactions with naphthalene 



GC: 



Apparatus: Carlo Erba Strumentazion Typ HRGC 4160 on Column 
2Q Injector; Column: DB5 30m x 0.2 mm; Material: 5% diphenyl- 

95% dimethylpolysiloxane; Carrier gas: 0.5 bar H 2 ; 
Temperature program: 40°C 1 min isothermic / 10°C/min to 300°C 
Rt( 1-naphthol) = 16.68 



NMR: 

1-Naphthol and cis-l,2-dihydroxy-l,2-dihydro-naphthalene were 
identified in the *H NMR. 



b) Example 6 was repeated but using, instead of n-octane, the 
30 substrate 8-methylquinoline. 5-Hydroxy-8-methylquinoline was 

identified as main product, in addition to other derivatives 
(product ratio 5:1). 35% of the starting material used had 



35 c) Example 6 was repeated but using, instead of n-octane, the 

substrate a-ionone. 3-Hydroxy-a-ionone was identified as main 
product, in addition to other derivatives (product ratio: 
76:24). 60% of the starting material used had been converted. 

4 q d) Example 6 was repeated, but using, instead of n-octane, the 
substrate cumene ( isopropylbenzene) . Five monohydroxy 
products and one dihydroxy product were identified. 70% of 
the starting material used had been converted. 



45 
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sequence: listing 

<110> BASF Aktiengesellschaf t 

<12 0> Novel cytochrome P4 50 monooxygenases and their use for the 
oxidation of organic substrates 

<130> M/40241 

<140> 
<141> 

<160> 9 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 3150 
<212> DNA 

<213> Bacillus megateriura 

<220> 

<221> CDS 

<222> (4).. (3150) 

<400> 1 

atg aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa 48 

Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys 

15 10 15 

aat tta ccg tta tta aac aca gat aaa ccg gtt caa get ttg atg aaa 9 6 
Asn Leu Pro Leu Leu Aen Thr Aap Lys Pro Val Gin Ala Leu Met Lys 

20 25 30 

att gcg gat gaa tta gga gaa ate ttt aaa ttc gag gcg cct ggt cgt 144 
lie Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg 

35 40 45 

gta acg cgc tac tta tea agt cag cgt eta att aaa gaa gca tgc gat 192 
Val Thr Arg Tyr Leu ser Ser Gin Arg Leu He Lys Glu Ala cys Asp 
50 55 60 

gaa tea cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt 240 
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg 
65 70 75 



t 



0050/50915 02380186 2002-01-23 



26 

gat ttt gca gga gac ggg tta ttt aca age tgg acg cat gaa aaa aat 2B8 

Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 

80 85 90 95 

■ * 

tgg aaa aaa gcg cat aat ate tta ctt cca age ttc agt cag cag gca 336 
Trp Lys Lys Ala His Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala 

100 105 110 

atg aaa ggc tat cat gcg atg atg gtc gat ate gec gtg cag ctt gtt 384 
Met Lys Gly Tyr His Ala Met Met Val Asp lie Ala Val Gin Leu Val 

115 120 125 

caa aag tgg gag cgt eta aat gca gat gag cat att gaa gta ccg gaa 43 2 
Gin Lys Trp Glu Arg Leu Asn Ala Asp Glu His lie Glu Val Pro Glu 
130 135 140 

gac atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac 48 0 
Asp Met Thr Arg Leu Thr Leu Asp Thr lie Gly Leu Cy s Gly Phe Asn 
145 150 



tat cgc ttt aac age ttt tac cga gat cag cct cat cca ttt att aca 528 
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe lie Thr 
160 165 170 175 

agt atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca 576 
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala 

180 185 190 

aat cca gac gac cca get tat gat gaa aac aag egc cag ttt caa gaa 624 
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu 

195 200 205 

gat ate aag gtg atg aac gac eta gta gat aaa att att gca gat cgc 672 
Aep lie Lys Val Met Asn Asp Leu val Asp Lys lie He Ala Asp Arg 
210 215 220 

aaa gca age ggt gaa caa age gat gat tta tta acg cat atg eta aac 720 
Lys Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn 
225 230 235 

gga aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc 768 
Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn lie Arg 
240 245 250 



tat caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt 816 
Tyr Gin lie lie Thr Phe Leu lie Ala Gly His Glu Thr Thr Ser Gly 

260 265 270 

ctt tta tea ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta 864 
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu 

275 280 285 
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c&a aaa gca gca gaa gaa gee gca cga gtt eta gta gat cct gtt cca 912 
Gin Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro 
290 295 300 

age tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac 960 
Ser Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn 
305 310 315 

gaa gcg ctg cgc tta tgg cca act get cct gcg ttt tec eta tat gca 1008 
Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala 
320 325 330 335 

aaa gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac 1056 
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp 

340 345 350 

gaa eta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg 1104 
Glu Leu Met Val Leu lie Pro Gin Leu His Arg Asp Lys Thr lie Trp 

355 360 365 

gga gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt 1152 
Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser 
370 375 380 

gcg att ceg cag cat gcg ttt aaa ceg ttt gga aac ggt cag cgt gcg 1200 
Ala lie Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala 
385 390 395 

tgt ate ggt cag cag ttc get ctt cat gaa gca acg ctg gta ctt ggt 1248 
Cys He Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly 
400 405 410 415 

atg atg eta aaa cae ttt gac ttt gaa gat cat aca aac tac gag ctg 1296 
Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu 

420 425 430 

gat att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa 1344 
Asp He Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys 

435 440 445 

gca aaa teg aaa aaa att ceg ctt ggc ggt att cct tea cct age act 13 92 
Ala Lys Ser Lys Lys He Pro Leu Gly Gly lie Pro Ser Pro Ser Thr 
450 455 460 

• 

gaa cag tct get aaa aaa gta cgc aaa aag gca gaa aac get cat aat 1440 
Glu Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn 
465 470 475 

acg ceg ctg ctt gtg eta tac ggt tea aat atg gga aca get gaa gga 148 8 
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly 
480 485 490 495 



* 

> » 
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acg gcg cgt gat tta gca gat. att gca atg age aaa gga ttt gca ccg 1536 
Thr Ala Arg Asp Leu Ala Asp lie Ala Met Ser Lys Gly Phe Ala Pro 

500 505 510 

cag gtc gca acg ctt gat tea cac gee gga aat ctt ccg cgc gaa gga 1584 
Gin Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly 

515 520 525 

get gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac 1632 
Ala Val Leu lie Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn 
530 535 540 

gca aag caa ttt gtc gac tgg tta gac caa gcg tct get gat gaa gta 1680 
Ala Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val 
545 550 



aaa ggc gtt cgc tac tec gta ttt gga tgc ggc gat aaa aac tgg get 1728 
Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala 
560 565 570 575 

act acg tat caa aaa gtg cct get ttt ate gat gaa acg ctt gee get 1776 
Thr Thr Tyr Gin Lys Val Pro Ala Phe lie Asp Glu Thr Leu Ala Ala 

580 585 590 

aaa ggg gca gaa aac ate get gac cgc ggt gaa gca gat gca age gac 1824 
-Lys -Gly Ala Glu -Asn lie Ala Asp- Arg Gly Glu Ala Asp Ala Ser Asp 

595 600 605 

gac ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac 1872 
Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 
610 615 620 

gta gca gee tac ttt aac etc gac att gaa aac agt gaa gat aat aaa 1920 
Val Ala Ala Tyr Phe Asn Leu Asp He Glu Asn Ser Glu Asp Asn Lys 
625 630 635 

tct act ctt tea ctt caa ttt gtc gac age gee gcg gat atg ccg ctt 1968 
ser Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu 
640 645 650 655 

gcg aaa atg cac ggt gcg ttt tea acg aac gtc gta gca age aaa gaa 2016 
Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 

660 665 670 

ctt caa cag cca ggc agt gca cga age acg cga cat ctt gaa att gaa 2064 
Leu Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu lie Glu 

675 680 685 

ctt cca aaa gaa get tct tat caa gaa gga gat cat tta ggt gtt att 2112 
Leu Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val He 
690 695 700 
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cct cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc 2160 
Pro Arg Asn Tyr Glu Gly lie Val Asn Arg Val Thr Ala Arg Phe Gly 
705 710 715 

eta gat gca tea cag caa ate cgt ctg gaa gca gaa gaa gaa aaa tta 220 a 
Leu Asp Ala Ser Gin Gin lie Arg Leu Glu Ala Glu Glu Glu Lys Leu 
720 725 730 735 

get cat ttg cca etc get aaa aca gta tec gta gaa gag ctt ctg caa 2256 
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin 

740 745 750 

tac gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg 2304 
Tyr Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met 

755 760 765 

get get aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gec ttg 2 3 52 
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 
770 775 780 

ctt gaa aag caa gec tac aaa gaa caa gtg ctg gca aaa cgt tta aca 2 400 
Leu Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 

atg ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc age 244 8 
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser 
800 805 810 815 

gaa ttt ate gee ctt ctg cca age ata cgc ccg cgc tat tac teg att 2496 
Glu Phe He Ala Leu Leu Pro Ser lie Arg Pro Arg Tyr Tyr Ser He 

820 825 830 

tct tea tea cct cgt gtc gat gaa aaa caa gca age ate acg gtc age 2544 
Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser lie Thr Val Sar 

835 840 845 

gtt gtc tea gga gaa gcg tgg age gga tat gga gaa tat aaa gga att 2592 
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly He 
650 855 860 

gcg teg aac tat ctt gec gag ctg caa gaa gga gat acg att acg tgc 2 640 
Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr tie Thr Cys 
865 870 875 

ttt att tec aca ccg cag tea gaa ttt acg ctg cca aaa gac cct gaa 268 8 
Phe lie Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 
880 885 890 895 

acg ccg ctt ate atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga 27 36 
Thr Pro Leu lie Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 

900 905 910 
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ggc ttt gtg cag gcg cgc aaa cag eta aaa gaa caa gga cag -tea ctt 2 784 
Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 

915 920 925 



gga gaa gca cat tta tac 
Gly Glu Ala His Leu Tyr 
930 

ctg tat caa gaa gag ctt 
Leu Tyr Gin Glu Glu Leu 
945 



ttc ggc tgc cgt tea cct 
Phe Gly Cys Arg Ser Pro 
935 

gaa aac gec caa age gaa 
Glu Asn Ala Gin Ser Glu 
950 955 



cat gaa gae tat 2832 

His Glu Asp Tyr 

940 

ggc ate att acg 2880 
Gly lie lie Thr 



ctt cat ace get ttt tct cgc atg cca aat cag ccg aaa aca tac gtt 2928 
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val 
960 965 970 975 

cag cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat 2976 
Gin His Val Met Glu Gin Asp Gly Lys Lys Leu lie Glu Leu Leu Asp 

980 985 990 

caa gga gcg cac ttc tat att tge gga gae gga age caa atg gca cct 3024 
Gin Gly Ala His Phe Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro 

995 1000 1005 

gec gtt gaa gca acg ctt atg aaa age tat get gac gtt cac caa gtg 3072 
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1020 

agt gaa gca gac get cgc tta tgg ctg cag cag eta gaa gaa aaa ggc 3120 
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly 
1025 1030 1035 

cga tac gca aaa gac gtg tgg get ggg taa 3150 
Arg Tyr Ala Lys Asp Val Trp Ala Gly 
1040 1045 

<210> 2 
<211> 1048 
<212> PRT 

<213> Bacillus megaterium 
<400> 2 

Thr He Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys Asn 
1 5 10 15 

Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys He 

20 25 30 

Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg Val 
35 40 45 
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Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp Glu 
50 55 eo 

Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg Asp 
65 70 75 80 

Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn Trp 

85 90 95 

Lys Lys Ala His Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala Met 

100 105 no 

Lys Gly Tyr His Ala Met Met Val Asp He Ala Val Gin Leu Val Gin 
115 120 125 

Lys Trp Glu Arg Leu Asn Ala Asp Glu His He Glu Val Pro Glu Asp 
130 135 140 

Met Thr Arg Leu Thr Leu Asp Thr He Gly Leu Cys Gly Phe Asn Tyr 
145 150 155 i 6 o 

Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe He Thr Ser 

1«5 170 i 7 5 

Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala Asn 

180 185 190 

Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu Asp 
!95 200 205 

He Lys Val Met Asn Asp Leu Val Asp Lys He He Ala Asp Arg Lys 
210 215 220 

Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly 
225 230 235 240 

Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn He Arg Tyr 

245 250 255 

Gin He He Thr Phe Leu He Ala Gly His Glu Thr Thr Ser Gly Leu 

260 265 270 

Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gin 
275 280 285 

Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser 
290 295 300 

Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn Glu 
305 310 315 320 
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Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys 

325 330 335 

Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu 

340 345 350 

Leu Met Val Leu lie Pro Gin Leu His Arg Asp Lys Thr lie Trp Gly 
355 360 365 

Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala 
370 375 380 

lie Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala Cys 
385 390 395 400 

lie Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met 

405 410 415 

Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp 

420 425 430 

lie Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala 
435 440 445 

Lys Ser Lys Lys lie Pro Leu Gly Gly He Pro Ser Pro Ser Thr Glu 

- 450 455 - 460 

Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr 
465 470 475 480 

Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr 

485 490 495 

Ala Arg Asp Leu Ala Asp He Ala Met Ser Lys Gly Phe Ala Pro Gin 

500 505 510 

Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala 
515 520 525 

Val Leu He Val Thr Ala Ser Tyr Asn Gly Bis Pro Pro Asp Asn Ala 
530 535 540 

Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val Lys 
545 550 555 560 

Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr 

565 570 575 



Thr Tyr Gin Lys Val Pro Ala Phe He Asp Glu Thr Leu Ala Ala Lys 

580 585 590 
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Gly Ala Glu Asn lie Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp 
595 600 605 

Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val 
610 615 620 

Ala Ala Tyr Phe Asn Leu Asp lie Glu Asn Ser Glu Asp Asn Lys Ser 
625 630 635 640 

Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala 

645 650 655 

Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu 

660 665 670 

Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu He Glu Leu 
675 680 685 

Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val lie Pro 
690 695 700 

Arg Asn Tyr Glu Gly lie Val Asn Arg Val Thr Ala Arg Phe Gly Leu 
705 710 715 720 

Asp Ala Ser Gin Gin He Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala 

725 730 735 

His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin Tyr 

740 745 750 

Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met Ala 
755 760 765 

Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu 
770 775 780 

Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr Met 
785 790 795 BOO 

Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu 

805 810 815 

Phe He Ala Leu Leu Pro Ser He Arg Pro Arg Tyr Tyr Ser He Ser 

820 825 830 

Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser He Thr Val Ser Val 
835 840 845 



Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly He Ala 
850 855 860 
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Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr lie Thr Cys Phe 
865 870 875 880 

He Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr 

885 890 895 

Pro Leu He Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly 

900 905 910 

Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu Gly 
915 920 925 

Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu 
930 935 940 

Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly He lie Thr Leu 
945 950 955 960 

His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lye Thr Tyr Val Gin 

965 970 975 

His Val Met Glu Gin Asp Gly Lys Lys Leu lie Glu Leu Leu Asp Gin 

980 985 990 

Gly Ala His Phe Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro Ala 
995 1000 1005 

Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val Ser 
1010 1015 1020 

Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly Arg 
1025 1030 1035 1040 

Tyr Ala Lys Asp Val Trp Ala Gly 

1045 

<210> 3 
<211> 30 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic sequence: PCR primer 
<400> 3 

gcaggagacg ggttgnnnac aagctggacg 

<210> 4 
<211> 30 
<212> DNA 

<213> Synthetic sequence 
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<220> 

<223> Description of the synthetic sequence: PGR primer 
<400> 4 

cgtccagctt gtnnncaacc cgtctcctgc 

<210> 5 
<211> 34 
<212> DHA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic sequence: PCR primer 
<400> 5 

gaagcaatga acaagnnnca gcgagcaaat ccag 

<210> 6 
<211> 30 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic sequence: PCR primer 
<400> 6 

ctggatttgc tcgctgnnnc ttgttcattg 

<210> 7 
<211> 41 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic sequence: PCR primer 
<400> 7 

gctttgataa aaacttaaag tcaannnctt aaatttgtac g 

<210> 8 
<211> 40 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic sequence: PCR primer 
<400> 8 

cgtacaaatt taagnnnttg acttaagttt ttatcaaagc 
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<210> 9 
<211> 1049 
<212> PRT 

<213> Bacillus megaterium 
<400> 9 

Met Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys 
1 5 10 15 

Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys 

20 25 30 

lie Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg 
35 40 45 

Val Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cya Asp 
50 55 60 

Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg 
65 70 75 80 

Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 

85 90 95 

Trp Lys Lys Ala His Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala 

100 r- 105 - no 

Met Lys Gly Tyr His Ala Met Met Val Asp lie Ala Val Gin Leu Val 
115 120 125 

Gin Lys Trp Glu Arg Leu Asn Ala Asp Glu His He Glu Val Pro Glu 
130 135 140 

Asp Met Thr Arg Leu Thr Leu Asp Thr lie Gly Leu Cys Gly Phe Asn 
145 150 155 160 

Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe lie Thr 

165 170 175 

Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala 

180 185 190 

Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu 
195 200 205 

Asp lie Lys Val Met Asn Asp Leu Val Asp Lys He lie Ala Asp Arg 
210 215 220 

Lys Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn 
225 230 235 240 
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Gly Lya Aap Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn He Arg 

245 250 255 

Tyr Gin He He Thr Phe Leu lie Ala Gly His Glu Thr Thr Ser Gly 

260 265 270 

Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu 
275 280 285 

Gin Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro 
290 295 300 

Ser Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn 
305 310 315 320 

Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala 

325 330 335 

Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp 

340 345 350 

Glu Leu Met Val Leu lie Pro Gin Leu Hia Arg Asp Lys Thr Xle Trp 
355 360 365 

Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser 
370 375 380 

Ala He Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala 
385 390 395 400 

Cys He Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly 

405 410 415 

Met Met Leu Lys His Phe Aap Phe Glu Asp His Thr Asn Tyr Glu Leu 

420 425 430 

Asp He Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys 
435 440 445 

Ala Lys Ser Lys Lys He Pro Leu Gly Gly He Pro Ser Pro Ser Thr 
450 455 460 

Glu Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn 
465 470 475 480 

Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly 

485 490 495 



Thr Ala Arg Asp Leu Ala Asp He Ala Met Ser Lys Gly Phe Ala Pro 

500 505 510 
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Gin Val Ala Thr Leu Asp Ser His 
515 520 

Ala Val Leu lie Val Thr Ala Ser 

530 535, 

Ala Lys Gin Phe Val Asp Trp Leu 
545 550 

Lys Gly Val Arg Tyr Ser Val Phe 

565 



38 

Ala Gly Asn Leu Pro Arg Glu Gly 

525 

Tyr Asn Gly Hia Pro Pro Asp Asn 

540 

Asp Gin Ala Ser Ala Asp Glu Val 
555 560 

Gly Cys Gly Asp Lys Asn Trp Ala 
570 575 



Thr Thr Tyr Gin Lys Val Pro Ala Phe lie Asp Glu Thr Leu Ala Ala 

580 585 590 

Lys Gly Ala Glu Asn lie Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp 
595 600 605 

Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 
610 615 620 

Val Ala Ala Tyr Phe Asn Leu Asp lie Glu Asn Ser Glu Asp Asn Lys 
625 630 635 640 

Ser Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu 

645 650 655 

Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 

660 665 670 

Leu Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu lie Glu 
675 680 685 

Leu pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val He 
690 695 700 

Pro Arg Asn Tyr Glu Gly lie Val Asn Arg Val Thr Ala Arg Phe Gly 
705 710 715 720 

Leu Asp Ala Ser Gin Gin lie Arg Leu Glu Ala Glu Glu Glu Lys Leu 

725 730 735 

Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin 

740 745 750 

Tyr Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met 
755 760 765 



Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 
770 775 780 
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Leu Glu Lya Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 800 

Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cya Glu Met Lys Phe Ser 

805 810 815 

Glu Phe lie Ala Leu Leu Pro Ser lie Arg Pro Arg Tyr Tyr Ser lie 

820 825 830 

Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser lie Thr Val Ser 
835 840 B45 

Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly lie 
850 855 860 

Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr lis Thr Cys 
865 870 875 880 

Phe lie Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 

885 890 895 

Thr Pro Leu He Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 

900 905 910 

Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 
915 920 925 

Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 
930 935 940 



Leu Tyr Gin Glu Glu Leu Glu Asn 
945 950 



Gin Ser Glu Gly He He Thr 
955 960 



Leu His Thr Ala Phe Ser Arg Met 

965 



Asn Gin Pro Lys Thr Tyr Val 
970 975 



Gin His Val Met Glu Gin Asp Gly Lys Lys Leu lie Glu Leu Leu Asp 

980 985 990 

Gin Gly Ala His Phe Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro 
995 1000 1005 

Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1020 



Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly 
1025 1030 1035 1040 



Arg Tyr Ala Lys Asp Val Trp Ala Gly 

1045 
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PCT/EPOO/07253 



SEQUENZPROTOROLL 

<110> BASF Aktiengesellschaf t 

<120> Neue Cytochrom P450 Monooxygenasen und deren Verwendung zur 
Oxidation von organischen Substraten 

<130> M/40241 

<140> 
<141> 

<160> 9 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 3150 
<212> DNA 

<213> Bacillus megaterium 

<220> 

<221> CDS 

<222> (4). .(3150) 

<400> 1 

atg aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa 

Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys 

1 5 10 15 

aat tta ccg tta tta aac aca gat aaa ccg gtt caa get ttg atg aaa 
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys 

20 25 30 

att gcg gat gaa tta gga gaa ate ttt aaa ttc gag gcg cct ggt cgt 
lie Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg 

35 40 45 

gta acg cgc tac tta tea agt cag cgt eta att aaa gaa gca tgc gat 
Val Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp 
50 55 60 

gaa tea cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt 
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg 
65 70 75 

gat ttt gca gga gac ggg tta ttt aca age tgg acg cat gaa aaa aat 
Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 
80 85 90 95 



48 



96 



144 



192 



240 



288 
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tgg aaa aaa gcg cat aat ate tta ctt cca age ttc agt cag cag gca 336 

Trp Lys Lys Ala His Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala 

100 105 HO 

atg aaa ggc tat cat gcg atg atg gtc gat ate gee gtg cag ctt gtt 384 
Met Lys Gly Tyr His Ala Met Met Val Asp lie Ala Val Gin Leu Val 

115 120 125 

eaa aag tgg gag cgt eta aat gca gat gag cat att gaa gta ccg gaa 432 
Gin Lys Trp Glu Arg Leu Asn Ala Asp Glu His He Glu Val Pro Glu 
130 135 140 

gac atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac 480 
Asp Met Thr Arg Leu Thr Leu Asp Thr He Gly Leu Cys Gly Phe Asn 
145 150 155 

tat cgc ttt aac age ttt tac cga gat cag cct cat cca ttt att aca 528 
Tvr Arq Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe He Thr 

ir=. 170 175 

160 I 65 

agt atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca 576 
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala 

180 185 1»° 



aat cca gac gac cca get tat gat gaa aac aag cgc cag ttt eaa gaa 
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu 

195 200 205 

gat ate aag gtg atg aac gac eta gta gat aaa att att gca gat cgc 
Asr> He Lys Val Met Asn Asp Leu Val Asp Lys He He Ala Asp Arg 
210 215 220 

aaa gca age ggt gaa eaa age gat gat tta tta acg cat atg eta aac 
Lys Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn 
225 230 235 

gga aaa gat eea gaa acg ggt gag ecg ctt gat gac gag aac att cgc 
Glv Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn He Arg 

, ic 250 255 

240 245 

tat eaa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt 
Tyr Gin He He Thr Phe Leu He Ala Gly His Glu Thr Thr Ser Gly 

260 26 5 270 

ctt tta tea ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta 
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu 

275 280 285 

eaa aaa gca gca gaa gaa gca gca ega gtt eta gta gat cct gtt cca 
Gin Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro 
290 295 300 



624 



672 



720 



768 



816 



864 



912 
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age tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac 960 
Ser Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn 
305 310 315 

gaa gcg ctg cgc tta tgg cca act get cct gcg ttt tec eta tat gca 1008 
Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala 
320 325 330 335 



aaa gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac 
Lys Glu Asp Thr val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp 

340 345 350 



1056 



gaa eta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg 
Glu Leu Met Val Leu lie Pro Gin Leu His Arg Asp Lys Thr lie Trp 

360 365 



1104 



gga gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa a at cca agt 
Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser 
370 375 380 



1152 



gcg att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg 
Ala lie Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala 
385 390 395 

tgt ate ggt cag cag ttc get ctt cat gaa gca acg ctg gta ctt ggt 
Cys lie Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly 
400 405 410 415 



atg atg eta aaa cac ttt gac ttt gaa gat cat aca 
Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr 

420 425 



tac gag ctg 
Tyr Glu Leu 
430 



1200 



1248 



1296 



gat att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa 
Asp He Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys 

435 440 445 

gca aaa teg aaa aaa att ccg ctt ggc ggt att cct tea cct age act 
Ala Lys Ser Lys Lys He Pro Leu Gly Gly lie Pro Ser Pro Ser Thr 
450 455 460 

gaa cag tct get aaa aaa gta cgc aaa aag gca gaa aac get cat aat 
Glu Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn 
465 470 475 

acg ccg ctg ctt gtg eta tac ggt tea aat atg gga aca get gaa gga 
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly 
480 485 490 495 



1344 



1392 



1440 



1488 



acg gcg cgt gat tta gca gat att gca atg age aaa gga ttt gca ccg 
Thr Ala Arg Asp Leu Ala Asp lie Ala Met Ser Lys Gly Phe Ala Pro 

500 505 510 



1536 
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cag gtc gca acg ctt gat tea cac gec gga aat ctt ccg cgc gaa gga 1584 

Gin Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly 

515 520 525 

get gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac 163 2 

Ala Val Leu lie Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn 
530 535 540 

gca aag caa ttt gtc gac tgg tta gac caa gcg tct get gat gaa gta 1680 

Ala Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val 
545 550 555 

aaa ggc gtt cgc tac tec gta ttt gga tgc ggc gat aaa aac tgg get 17 28 

Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala 

560 565 570 575 

act acg tat caa aaa gtg cct get ttt ate gat gaa acg ctt gee get 17 7 6 

Thr Thr Tyr Gin Lys Val Pro Ala Phe lie Asp Glu Thr Leu Ala Ala 

580 585 590 

aaa ggg gca gaa aac ate get gac cgc ggt gaa gca gat gca age gac 18 24 

Lys Gly Ala Glu Asn lie Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp 

595 600 605 



gac ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac 
Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 
610 615 620 

gta gca gee tac ttt aac etc gac att gaa aac agt gaa gat aat aaa 
Val Ala Ala Tyr Phe Asn Leu Asp lie Glu Asn Ser Glu Asp Asn Lys 
625 630 635 

tct act ctt tea ctt caa ttt gtc gac age gee gcg gat atg ccg ctt 
Ser Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu 
640 645 650 655 

gcg aaa atg cac ggt gcg ttt tea acg aac gtc gta gca age aaa gaa 
Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 

660 665 670 

ctt caa cag cca ggc agt gca cga age acg cga cat ctt gaa att gaa 
Leu Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu lie Glu 

675 680 685 

ctt cca aaa gaa get tct tat caa gaa gga gat cat tta ggt gtt att 
Leu Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val He 
690 695 700 



1872 



1920 



1968 



2016 



2064 



2112 



cct cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc 
Pro Arg Asn Tyr Glu Gly He Val Asn Arg Val Thr Ala Arg Phe Gly 
705 710 715 



2160 
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cta gat gca tea cag caa ate cgt ctg gaa gca gaa gaa gaa aaa tta 2208 
Leu Asp Ala Ser Gin Gin lie Arg Leu Glu Ala Glu Glu Glu Lys Leu 
720 725 730 735 

get cat ttg cca etc get aaa aca gta tec gta gaa gag ctt ctg caa 2256 
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin 

740 745 750 

tac gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg 2304 
Tyr Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met 

755 760 765 

get get aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gec ttg 2352 
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 
770 775 780 

ctt gaa aag caa gec tac aaa gaa caa gtg ctg gca aaa cgt tta aca 2400 
Leu Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 



atg ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc age 
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser 
800 805 810 815 

gaa ttt ate gec ctt ctg cca age ata cgc ccg cgc tat tac teg att 
Glu Phe lie Ala Leu Leu Pro Ser lie Arg Pro Arg Tyr Tyr Ser lie 

820 825 830 

tct tea tea cct cgt gtc gat gaa aaa caa gca age ate acg gtc age 
Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser lie Thr Val Ser 

835 840 845 

gtt gtc tea gga gaa gcg tgg age gga tat gga gaa tat aaa gga att 
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly lie 
850 855 860 

gcg teg aac tat ctt gee gag ctg caa gaa gga gat acg att acg tgc 
Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr lie Thr Cys 
865 870 875 

ttt att tec aca ccg cag tea gaa ttt acg ctg cca aaa gac cct gaa 
Phe lie Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 
880 885 890 895 

acg ccg ctt ate atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga 
Thr Pro Leu lie Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 

900 905 910 

ggc ttt gtg cag gcg cgc aaa cag eta aaa gaa caa gga cag tea ctt 
Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 

915 920 925 



2448 



2496 



2544 



2592 



2640 



2688 



2736 



2784 



3072 
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gga gaa gca cat tta tac ttc ggc tgc cgt -tea cct cat gaa gac tat 2 8 32 

Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 
930 935 940 

ctg tat caa gaa gag ctt gaa aac gec caa age gaa ggc ate att acg 28 80 
Leu Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly lie lie Thr 
945 950 955 

ctt cat acc get ttt tct cgc atg cca aat cag ccg aaa aca tac gtt 29 2 8 
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val 
960 965 970 975 

cag cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat 2976 
Gin His Val Met Glu Gin Asp Gly Lys Lys Leu lie Glu Leu Leu Asp 

980 985 990 

caa gga gcg cac ttc tat att tgc gga gac gga age caa atg gca cct 302 4 
Gin Gly Ala His Phe Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro 

995 1000 1005 

gec gtt gaa gca acg ctt atg aaa age tat get gac gtt cac caa gtg 
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1020 

agt gaa gca gac get cgc tta tgg ctg cag cag eta gaa gaa aaa ggc 
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly 
1025 1030 1035 

cga tac gca aaa gac gtg tgg get ggg taa 3150 
Arg Tyr Ala Lys Asp Val Trp Ala Gly 
1040 1045 

<210> 2 
<211> 1048 
<212> PRT 

<213> Bacillus megateriuxn 
<400> 2 

Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys Asn 
! 5 10 15 

Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys lie 

20 25 30 

Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg Val 

35 40 45 



3120 



Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp Glu 
50 55 60 
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Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg Asp 
65 70 75 80 

Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn Trp 

85 90 95 

Lys Lys Ala His Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala Met 

100 105 HO 

Lys Gly Tyr His Ala Met Met Val Asp lie Ala Val Gin Leu Val Gin 
115 120 125 

Lys Trp Glu Arg Leu Asn Ala Asp Glu His He Glu Val Pro Glu Asp 
130 135 140 

Met Thr Arg Leu Thr Leu Asp Thr He Gly Leu Cys Gly Phe Asn Tyr 
145 150 155 160 

Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe He Thr Ser 

165 170 175 

Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala Asn 

180 185 190 

Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu Asp 
195 200 205 

lie Lys Val Met Asn Asp Leu Val Asp Lys lie lie Ala Asp Arg Lys 
210 215 220 

Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly 

235 240 

225 230 

Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn lie Arg Tyr 

245 250 255 

Gin lie He Thr Phe Leu lie Ala Gly His Glu Thr Thr Ser Gly Leu 

260 265 270 

Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gin 
275 280 285 

Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro 
290 295 300 



Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn Glu 
305 



310 315 320 



Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys 

325 330 335 
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Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu 

340 345 350 

Leu Met Val Leu lie Pro Gin Leu His Arg Asp Lys Thr lie Trp Gly 
355 360 365 

Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala 
370 375 380 



lie Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala Cys 
385 



390 395 400 



lie Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met 

405 410 415 

Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp 



Met 

420 



425 430 



He Lys Glu Thr Leu Thr Leu Lys Pro 
435 



Glu Gly Phe Val Val Lys Ala 



440 445 



Lys Ser Lys Lys He Pro Leu Gly Gly He Pro Ser Pro Ser Thr Glu 



450 



455 460 



Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr 

470 475 



465 



Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ale Glu Gly Thr 

485 49° 495 



Ala Asp He Ala Met Ser Lys Gly Phe Ala Pro Gin 

500 



Ala Arg Asp Leu Ala Asp He Ala Met s-er uy* ~- 

505 510 



Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala 

520 525 



515 

val Leu lie Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala 
530 "5 540 

Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val Lys 
545 550 555 

Gly val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr 

565 570 575 

Thr Tyr Gin Lys Val Pro Ala Phe He Asp Glu Thr Leu Ala Ala Lys 

_ 



580 



Gly Ala Glu Asn He Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp 

60 0 605 



595 
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Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val 
610 615 620 



Ala Ala Tyr Phe Asn Leu Asp lie Glu Asn Ser Glu Asp Asn Lys 
625 630 635 640 

Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala 

645 650 655 

Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu 

660 665 670 

Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu lie Glu Leu 
675 680 685 

Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val lie Pro 
690 695 700 

Arg Asn Tyr Glu Gly lie Val Asn Arg Val Thr Ala Arg Phe Gly Leu 
705 710 715 720 

Asp Ala Ser Gin Gin He Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala 

725 730 735 

His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin Tyr 

740 745 750 

Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met Ala 
755 760 765 

Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu 
770 775 780 

Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr Met 
785 790 795 800 

Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu 

805 810 815 

Phe He Ala Leu Leu Pro Ser He Arg Pro Arg Tyr Tyr Ser He Ser 

820 825 830 

Ser ser Pro Arg Val Asp Glu Lys Gin Ala Ser He Thr Val Ser Val 
835 840 845 

val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly He Ala 
850 855 860 

Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr He Thr Cys Phe 
865 870 875 880 
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Ile Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr 

885 890 895 

Pro Leu lie Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly 

900 905 910 

Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu Gly 
915 920 925 

Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu 
930 935 940 

Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly lie lie Thr Leu 
945 950 955 960 

His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val Gin 

965 970 975 

His Val Met Glu Gin Asp Gly Lys Lys Leu lie Glu Leu Leu Asp Gin 

980 985 990 

Gly Ala His Phe Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro Ala 
995 1000 1005 

Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val Ser 
1010 1015 1020 

Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly Arg 
025 1030 1035 1040 

Tyr Ala Lys Asp Val Trp Ala Gly 

1045 

<210> 3 
<211> 30 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic sequence: PCR primer 



<400> 3 

gcaggagacg ggttgnnnac aagctggacg 

<210> 4 

<211> 30 

<212> DNA 

<213> Synthetic sequence 



30 



<220> 

<223> Description of the synthetic sequence: PCR primer 
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<400> 4 

cgtccagctt gtnnncaacc cgtctcctgc 

<210> 5 
<211> 34 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic sequence: PCR primer 
<400> 5 

gaagcaatga acaagnnnca gcgagcaaat ccag 

<210> 6 
<211> 30 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic sequence: PCR primer 
<4O0> 6 

ctggatttgc tcgctgnnnc ttgttcattg 

<210> 7 
<211> 41 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic sequence: PCR primer 
<400> 7 

gctttgataa aaacttaaag tcaannnctt aaatttgtac g 

<210> 8 
<211> 40 
<212> DNA 

<213> Synthetic sequence 
<220> 

<223> Description of the synthetic sequence: PCR primer 
<400> 8 

cgtacaaatt taagnnnttg acttaagttt ttatcaaagc 

<210> 9 
<211> 1049 
<212> PRT 

<213> Bacillus raegaterium 
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<400> 9 

Met Thr Tie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys 
15 10 15 

Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys 

20 25 30 

He Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg 
35 40 45 

Val Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp 
50 55 60 

Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg 
65 70 75 80 

Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 

85 90 95 

Trp Lys Lys Ala His Asn He Leu Leu Pro Ser Phe Ser Gin Gin Ala 

100 105 HO 

Met Lys Gly Tyr His Ala Met Met Val Asp He Ala Val Gin Leu Val 
115 120 125 

Gin Lys Trp Glu Arg Leu Asn Ala Asp Glu His He Glu Val Pro Glu 
130 135 140 

Asp Met Thr Arg Leu Thr Leu Asp Thr He Gly Leu Cys Gly Phe Asn 
145 150 155 160 

Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe He Thr 

165 170 175 

Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala 

180 185 190 

Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu 
195 200 205 

Asp He Lys Val Met Asn Asp Leu Val Asp Lys He He Ala Asp Arg 
210 215 220 

Lys Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn 
225 230 235 240 

Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn He Arg 

245 250 255 

Tyr Gin He He Thr Phe Leu He Ala Gly His Glu Thr Thr Ser Gly 

260 265 270 
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Leu Leu Ser Phe Ala Leu Tyr Phe Leu val Lys Asn Pro His Val Leu 
275 280 285 

Gin Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro 
290 295 300 

Ser Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn 
305 310 315 320 

Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala 

325 330 335 

Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp 

340 345 350 

Glu Leu Met Val Leu lie Pro Gin Leu His Arg Asp Lys Thr lie Trp 
355 360 365 

Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser 
370 375 380 

Ala He Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala 
385 390 395 400 

Cys lie Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly 

405 410 415 

Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu 

420 425 430 

Asp He Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys 
435 440 445 

Ala Lys Ser Lys Lys He Pro Leu Gly Gly He Pro Ser Pro Ser Thr 
450 455 460 

Glu Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn 
465 470 475 480 

Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly 

485 490 495 . 

Thr Ala Arg Asp Leu Ala Asp He Ala Met Ser Lys Gly Phe Ala Pro 

500 505 510 

Gin Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly 
515 520 525 

Ala Val Leu He Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn 
530 535 .540 
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Ala Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val 
545 550 555 550 

Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala 

565 570 575 

Thr Thr Tyr Gin Lys Val Pro Ala Phe lie Asp Glu Thr Leu Ala Ala 

580 585 590 

Lys Gly Ala Glu Asn He Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp 
595 600 605 

Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 
610 615 620 

Val Ala Ala Tyr Phe Asn Leu Asp He Glu Asn Ser Glu Asp Asn Lys 
625 630 635 640 

Ser Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu 

645 650 655 

Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 

660 665 670 

Leu Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu lie Glu 
675 



680 685 



Leu Pro Lys Glu Ala Ser Tyr 
690 



Gin Glu Gly Asp His Leu Gly Val He 



695 700 



Pro Arg Asn Tyr Glu Gly He Val Asn Arg Val Thr Ala Arg Phe Gly 
705 710 715 

Leu Asp Ala Ser Gin Gin He Arg Leu Glu Ala Glu Glu Glu Lys Leu 

725 730 735 

Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin 

740 745 750 

Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met 

755 



Tyr Val Glu Leu Gin Asp Pro 

760 765 



Ala Ala Lys Thr Val Cys Pro Pro 
770 



His Lys Val Glu Leu Glu Ala Leu 



775 78° 



Leu Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 800 



Met Leu Glu Leu Leu Glu Lys Tyr 

805 



Pro Ala Cys Glu Met Lys Phe Ser 
810 815 
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Glu Phe He Ala Leu Leu Pro 

820 



15 

lie Arg Pro Arg Tyr Tyr 
825 B30 



Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser He Thr Val Ser 
835 840 845 

Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly lie 
850 855 860 

Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr He Thr Cys 
865 870 875 880 

Phe He Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 

885 890 895 

Thr Pro Leu He Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 

900 905 910 

Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 
915 920 925 

Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 
930 935 940 



Leu Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly He lie Thr 
945 950 955 960 



Leu His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val 

965 970 975 

Gin His Val Met Glu Gin Asp Gly Lys Lys Leu He Glu Leu Leu Asp 

980 985 990 

Gin Gly Ala His Phe Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro 
995 1000 1005 

Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1020 

Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly 
1025 1030 1035 1040 



Arg Tyr Ala Lys Asp Val Trp Ala Gly 

1045 
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claim : > 

A cytochrome P450 aonooxygtnase which is" capable of at least: 
on* of the following reactions: 



a) 


oxidation of 


optionally substituted N-, o- or 


b> 


S-heterooycli 
oxidation of 
aromatics ; 


wc mono- or polynuclear aromatic compounds; 
optionally substituted mono- or polynuclear 


c) 


oxidation of 
alkenes; 


straight-chain or branched alJcanes and 


d) 


oxidation of 
cycloalkenes; 


optionally substituted cycloalxanes and 



where the monooxygenase is derived from cytochrome P45 0 
monooxygenase BM-3 from Bacillus megaterium having an amino 
acid sequence according to SZQ XD HO: 2, which has at least 
one functional mutation Ln at least one of the amino acid 
sequence regions 172-224, 39-43. 48-52. 67-70, 330-335. 
352-356, 73-82 and 86-88; except the single mutant Phe87Val. 

A monooxygenase as claimed in claim 1, which has at least one 
functional mutation in at least one of the sequence regions 
73-82, 86-88 and 172-224. 

A. monooxygenase as claimed in claim 1, which has at least one 
of the following mono— or polyomino acid substitutions : 

a) PheB7Val, X*eul88Gln; or 

b) Phe87val, Leul88Gln, Xla74Gly; 

and functional equivalents thereof which are capable of at 
least one of the above oxidation reactions. 

A nucleic acid sequence coding for a monooxygenase according 
to one of the preceding claims . 

An expression construct comprising, under the genetic control 
of regulatory nucleic acid sequences, a coding sequence which 
comprises a nucleic acid sequence according to claim 4. 

A vector comprising at least one expression construct 
according to claim 5. 

A recombinant microorganism transformed by at least one 
vector as claimed in claim 6. 
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8. A microorganism as claimed in claim 7, selected from bacteria 



of the genus Escherichia. 



9- 



A process for the microbiological oxidation of an w- _ 
3 s-heterocyclic mono- or polynuclear aromatic compound, which 



10 



13 



el) culturing a recombinant microorganism which expresses a 
cytochrome P450 monooxygenase of bacterial, origin in a 
culture medium, in the presence of an exogenous oar 
intermediately formed substrate 1 or 

a2) incubating a substrate-containing reaction medium with a 
cytochrome P450 monooxygenase of bacterial origin; and 
b) isolating the oxidation product formed or a se< 
product thereof from the medium. 



10. a process as claimed in claim *, wherein the exogenous or 
intermediately formed substrate is selected from optionally 
substituted or s-heterocyclic mono- or polynuclear 
aromatic compounds . 

20 

11. A process as claimed in claim 9 or 10, where the 
monooxygenase is a mutant as claimed in any of claims 1 to 3 
including the mutant PheSTVal. ' 

35 12. a process as claimed in claim 11 , whezre the mutant has at 
least one of the following mono- or poly ami no acid 



a) »he87Val; 

b) Phe87Val, l*ul88Clnj or 

30 

c) Phe87val, X«ul88Gln, Ala7461y. 

13* A process for microbiological oxidation of a compound as 
defined in claim lb) , c) or d> r which comprises 
35 al) culturing a recombinant cytochrome P 430 -producing 

microorganism as claimed in claim 7 or 8 in a culture medium, 
in tne presence of an exogenous or intermediately formed 



e2) incubating a substrate-containing reaction medium with < 
«o cytochrome P450 monooxygenase as claimed in any of claims 1 

to 3; and 

b) isolating the oxidation product formed or a secondary 
product thereof from the medium; 

^ where the monooxygenase mutant E>he87Val ia not excluded • 

14. A process as claimed in claim 13 , wherein tne exogenous 
intermediately formed substrate is selected from* 
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20 



30 



a) optionally substituted mono- or polynuclear aromatics; 

b) straight-chain or branched alfeanes and alkenes; 

c) optionally substituted cyclpalkanes and cycloalkenes. 

S 

15. A process as claimed in claim 13 or 14, where the 

monooxygenafia ic a mutant as claimed in any of claims 1 to 3, 
including the mutant Phe87Val. 

XQ 16. A process as claimed in claim 15, where the mutant has at 
least one of the following mono- or polyamino acid 
substitutions i 

a) Phe87Val; 

b) Phe87Val, Leul88Glnj or 
c> Phe87Val, Leul88Gln, Ala74Gly. 

17. A process as claimed in any of claims 9 to 16, wherein, as 

exogenous substrate, at least one compound selected from the 
groups a) to d) of compounds defined above is added to a 
medium and the oxidation is carried out by enzymatic reaction 
of the substrate-containing medium in the presence of oxygon 
at a temperature of approximately 20 to 40©C and a pH of 
approximately 6 to 9, where the substrate-containing medium 
additionally contains an approximately 10- to 100-fold molar 
excess of reduction equivalents based on the substrate. 



35 



40 



18. A process as claimed in claim 17, wherein, as exogenous 
substrate, a compound selected from indole, n-hexane, 
n-octane, n-decane, n-dodecane, cumene, 1 -methyl indole, a-, (3- 
or Y-ionone, acridine, naphthalene, 6-methyl- or 
8-methylquinoline, quinoline and quinaldine is employed. 

19. A process for the microbiological production of indigo and/or 
indirubin, which comprises 

al) culturing a recombinant microorganism which produces an 
indole-oxidizing cytochrome P450 in a culture medium, in the 
presence of exogenous or intermediately formed indole? or 
a2) incubating an indole-containing reaction medium with an 
indole-oxidising cytochrome P450 monooxygenase; and 

b) isolating the oxidation product formed or a secondary 
product thereof from the medium; 



20. a process as claimed in claim 
indirubin obtained, which was 
intermediately formed indole. 



19 , wherein the indigo and/or 

produced by oxidation of 

is isolated from the medium. 
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21. A process as claimed in claim 20, wherein the indole 
oxidation is carried, out by culturin^ the microorganisms in 
the presence of oxygen at a culturing temperature of 
approximately 20 to 40*c and a pH of approximately 6 to 9 . 

5 

22. A process as claimed in claim 20 or 21, where the 
monooxygenaae is a mutant as claimed in any of claims 1 -to 3 
including the mutant Phe87Val. 

10 23. A process as claimed in claim 22 , where the mutant has at 
least one of the following mono- or polyaminb acid 
substitutions: 

a) Phe67Val; 

b) PheB7Val, l*ul88Gln; or 

c) Phe87Val, X*ul88Gln, Ala74Gly. 

24- A bioreactor comprising an enzyme as claimed in one of claims 
1 to 3 or a recombinant microorganism as claimed in one of 
7 or 8 in immobilized form. 



IS 



20 

* * 

. The use of a cytochrome P450 mo no oxygenase as claimed in om 
of claims 1 to 3, of a vector as claimed in claim 6, or of « 
microorganism as claimed in claim 7 or 8 for the 
microbiological oxidation of 

a) optionally substituted o- or s-heterocyclic mono- or 
polynucleax aromatic compounds; 

b) optionally substituted mono- ox polynuclear aromatiesr 

c) straight-chain or branched alkanes and alkenes; and/or 
30 d > optionally substituted cycloalkanes and cycloalkenes, 

where the mo no oxygenase mutant Phe67Val is not excluded. 

26. The use of a microorganism producing indole-oxidizing 
cytochrome P450 for the preparation of indigo and/ or 
indirubin. 



40 
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